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Abstract 

If each edge (u, v) of a graph G = (V^, E) is decorated with a permutation ttu.v of k objects, we say 
that it has a permuted k-coloring if there is a coloring a : V {1, . . . , fc} such that cr(«) 7^ 'Jiu,v{(y{u)) 
for all (u, v) G E. Based on arguments from statistical physics, we conjecture that the threshold d^, for 
permuted fc-colorability in random graphs G{n,m — dn/2), where the permutations on the edges are 
uniformly random, is equal to the threshold for standard graph fc-colorability. The additional symmetry 
provided by random permutations makes it easier to prove bounds on dk . By applying the second moment 
method with these additional symmetries, and applying the first moment method to a random variable 
that depends on the number of available colors at each vertex, we bound the threshold within an additive 
constant. Specifically, we show that for any constant e > 0, for sufficiently large k we have 

2fc In fc - In fc - 2 - e < dfc < 2fc In fc - In fc - 1 + e . 

In contrast, the best known bounds on dk for standard fc-colorability leave an additive gap of about In fc 
between the upper and lower bounds. 

1 Introduction 

We consider random graphs G(n, m) with n vertices and m edges chosen uniformly without replacement. 
We give each edge (u, v) an arbitrary orientation, and then associate it with a uniformly random permu- 
tation TTu,v G Sk, where Sk denotes the group of permutations of k objects. A permuted k-coloring of 
this decorated graph is a function a : V ^ {1, . . . , fc} such that o-{v) 7^ 'itu,v{o-{u)) for all edges {u,v). 
For convenience we will sometimes reverse the orientation of an edge, and write n^^u = tTu.Jj when it and 
V are distinct. 

We conjecture that there is a sharp threshold for the existence of such a coloring in terms of the 
average degree d — 2m/ n: 

Conjecture 1. For each fc > 3 there is a constant dk such that 



lim Pr [G{n, m = dn/2) has a permuted fc-coloring] 



1 d < dk 
d> dk 



where the probability space includes both the graph G{n,m) and the set of permutations {7r„,„}. More- 
over, we conjecture that dk is also the threshold for standard graph fc-colorability, which is the special 
case where Tru,v is the identity permutation for all u,v: 
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Conjecture 2. For the same dt as in Conjecture [T] 

lim Pr [G{n, m — dn/2) has a standard fc-coloring] = < < fc 

[0 d>dk, 

Why might these two thresholds be the same? First note that if G is a tree, we can "unwind" the 
permutations on the edges, changing them all to the identity, by permuting the colors at each vertex. For 
any set of permutations {7r„,„}, this gives a one-to-one map from permuted colorings to standard colorings. 
Since sparse random graphs are locally treelike, for almost all vertices v we can do this transformation on 
a neighborhood of radius 0(logn) around v. The effect on v's neighborhood of the other vertices' colors 
is then the same as it would be in standard graph coloring, except that the colors on the boundary are 
randomly permuted. 

In particular, suppose we choose a uniformly random coloring of a tree, erase the colors in its interior, 
and choose a new uniformly random coloring with the same boundary conditions. The reconstruction 
threshold is the degree d above which this new coloring retains a significant amount of information about 
the original coloring [3 [21], and it is closely related to the clustering transition [19]. Since we can unwind 
the permutations on a tree, permuted fc-colorability and standard fc-colorability trivially have the same 
reconstruction threshold. 

For another argument, consider the following alternate way to choose the permutations on the edges. 
First choose a uniformly random permutation 7r„ at each vertex u. Then, on each edge {u,v), let 
TTu,!, = TTu^-Kv. (Algebraically, 7ru,„ : E ^ Sk is, the coboundary of 7r„ : V" — >■ Sk-) Since c(u) — ■Ku,v(c{v)) 
if and only if 7r„(c(D)) = 7r„(c(ii)), a local change of variables again gives a one-to-one map from permuted 
colorings to standard ones. Now note that choosing 7r„,„ in this way yields a uniform joint distribution on 
any set of edges that does not include a cycle; for instance, on a graph of girth g the ■Ku,v are {g— l)-wise 
independent and uniform. Since most loops in a sparse random graph are long, we might hope that this 
distribution on the {tiu.v} is the same, for all practical purposes, as the uniform distribution. 

Finally, perhaps the most convincing argument for Conjecture[2]comes from statistical physics. Using 
cavity field equations to analyze the asymptotic behavior of message-passing algorithms such as belief 
propagation and survey propagation, we can derive thresholds for satisfiability or colorability [171 1161 , 
as well as other thresholds such as clustering, condensation, and freezing [231 113j. However, assuming 
that there is an equal density of vertices of each color, the cavity field equations for fe-coloring in a 
random graph are identical to those for permuted fc-coloring [141 122[ : they simply express the fact that 
each edge (u, v) forbids u from taking a single color that depends on the color of v. 

Thus if the physics picture is correct — and parts of it have been shown rigorously (e.g. [T][21[I0]) — 
then colorings and permuted colorings have the same "thermodynamics" on sparse random graphs, and 
hence the same thresholds for colorability, as well as for clustering, condensation, and freezing. 

Conjecture [2] is attractive because it is easier, given current methods, to prove tight bounds on the 
threshold for permuted fc-coloring than it is for standard fc-coloring. First we recall a simple upper bound. 
Let X denote the number of permuted fc-colorings. Since the permutations are chosen independently, 
the probability that any given coloring g is proper is (1 — l/fc)™. (Indeed, this is true of any multigraph 
with m edges.) Thus the expected number of colorings is 

E[X] = fc"(l - l/fc)™ = [fc(l - l/kf'^Y ■ (1) 

This is exponentially small if k{l — l/fc)'*"'^ < 1, in which case X — Q with high probability by Markov's 
inequality. Thus 

dk < , <2fclnfc-lnfc. (2) 

— ln(l — 1/k) 

Using the second moment method, we will prove a lower bound on dk that is an additive constant 
below this upper bound. We also improve the upper bound, using a random variable that depends on the 
number of available colors at each vertex. Our results show that, for any constant e > and k sufficiently 
large, 

2fc In fc - In fc - 2 - e < dfe < 2fe In fc - In fc - 1 -I- e . 

In contrast, the best known lower bound on the threshold for standard fc-colorability is roughly Infc below 
the first moment upper bound. 
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To simplify our arguments, we work in a modified random graph model G{n, m) where the m edges are 
chosen uniformly with replacement, and the endpoints of each edge are chosen uniformly with replacement 
from the n vertices. As a consequence, both self-loops and multiple edges occur with nonzero probability. 
Note that, unlike standard fc-colorability, a self-loop at a vertex v does not necessarily render the graph 
uncolorable: it simply means that cr(«) cannot be a fixed point of the permutation on the loop, i.e., 
a{v) / ■7r„,„((T(u)). 

However, if 7r„,„ is the identity then coloring is impossible, and this occurs with probability l/k\. 
Similarly, if u and v have k edges between them, then with constant probability the permutations on 
these edges make a coloring impossible. As a consequence, the probability that G{n, m) with random 
permutations has a permuted fc-coloring is bounded below 1. 

Our bounds proceed by showing that G(n,m) is permuted-fc-colorable with probability 0(1) if d is 
sufficiently small, and is not permuted-fc-colorable with high probability if d is sufficiently large. In the 
sparse case m = 0(n), G{n,m) is simple with probability in which case it coincides with G{n,m). 

Thus, assuming that Conjecture [T] is true, these values of d are bounds on the threshold d^ for G{n,m). 

The rest of the paper is organized as follows. In Section [2] we give our second moment lower bound. 
In Section [3] we give our upper bound, which uses a random variable that depends on the number of 
available colors. In Sectional we prove an isoperimetric inequality relevant to this random variable. We 
defer the parts of our proofs that are "mere calculus" to Section [S] 



2 The second moment lower bound 

As in the simple first moment upper bound, let X denote the number of permuted fc-colorings of a random 
multigraph G{n, m) with uniformly random permutations on its edges. The Cauchy-Schwarz inequality 
applied to X ■ l{x>o} gives 

Our goal is to show that E[X^]/E[X]^ — 0(1), so that a permuted coloring exists with probability 

for a certain value of d. Assuming Conjecture [T] i.e., that a threshold dk exists, any such d is a lower 

bound on dk. 

Computing the second moment E[X^] requires us to sum, over all pairs of colorings a, r, the prob- 
ability P(<T, t) that both a and r are proper. Since the edges of G and their permutations are chosen 
independently, we have 

P{a,T) =p{a,T)"' , 

where p{a, t) is the probability that a random edge (u, v), with a random permutation tt, is satisfied by 
both colorings. That is, 

p{(J,t) = Pr [ij(u) 7^ 7r(cr(«)) and t{u) 7^ 7r(r(u))] . 

For random constraint satisfaction problems where each variable takes one of two values, such as fc-SAT 
or hypergraph 2-coloring [31 [HIT], p{g, t) is a function p(^) just of the overlap between a and r, i.e., the 
fraction of variables on which they agree. The second moment can then be bounded by maximizing a 
function of which is typically a simple calculus problem. 

For pairs of fc-colorings, however, p{a, r) depends on a fc-by-fc matrix of overlaps, where C^i^j is the 
fraction of vertices v such that cr(ii) = i and r(u) = j. Computing the second moment then requires us to 
bound a function of roughly fc^ variables, a difficult high-dimensional maximization problem. Achlioptas 
and Naor ^6^ used convexity arguments to bound this function on the Birkhoff polytope, showing that 

dfe > 2(fc - 1) ln(fc - 1) 2fc In fc - 2 In fc . 

This leaves an additive gap of about Infc between the upper and lower bounds. Note, however, that 
this bound is tight enough to determine, almost surely, the chromatic number x{G) as a function of the 
average degree to one of two possible integers, namely fc or fc-|-l where fc is the smallest integer such that 
2fclnfc > d. Achlioptas and Moore extended these arguments to random regular graphs [5], determining 
X(G') as a function of d to fc, fc -I- 1, or fc -I- 2. 

For permuted colorings, the second moment calculation is much easier. The random permutations 
create additional local symmetries, making p{a, r) a function only of the fraction on which the two 
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colorings agree. Thus we just have to maximize a function of a single variable. As a consequence, we can 
prove a lower bound on dk that matches the upper bound within an additive constant. 

Theorem 3. For any £ > 0, for sufficiently large k we have 

dfc > 2it In - In fc - 2 - e . (3) 

Proof. To compute the second moment, we sum over all fc"(^)(fc — pairs of colorings that agree at 

z of the n vertices. We say that such a pair has overlap C, = z/n. Then 

n 

z 



where p(^) is the probability that a random edge, with a random permutation, is satisfied by both 
colorings. Inclusion-exclusion gives 



Note that ^ 

P(l/fe)= (l-i' 

corresponding to the fact that two independently random colorings typically have overlap C, — l/fc + o(I). 
We proceed as in [3]. Approximating the sum (Q with an integral and using ([T]) gives 



2 . . 

z=0 



E[X] 

where ~ hides multiplicative constants, where 



^t^. Jo 



m = HO + (1 - C) ln(fc - 1) - In fc + ^ In . ^^^^ 



2 (l-l/fc)2' 

and where = — ^Ini^ ~ (1 ^ C) ~ C) is the entropy function. Applying Laplace's method to the 
integral ([Sj then gives 

W ~ V!'^"(C-x)| ' 

where (^max = argmax^gjg i]<^(C) is the global maximum of assuming that it is unique and that 

</>"(Cmax) < 0. 

We have (/>(l/fc) = 0, so if Cmax = 1/k and <j>"{l/k) < then E[X^]/E[Xf = 0(1) and Pr[X > 0] = 
f2(l). The proof is then completed by the following lemma: 

Lemma 4. For any constant e > 0, if d = 2A; In fc — In A; — 2 — £ and k is sufficiently large, (p'^l/k) < 
and (^(C) < for aU Cj^l/k. 

We defer the proof of this lemma to Section [5] □ 

3 An improved first moment upper bound 

In this section we apply the first moment method to a weighted random variable, and improve the upper 
bound ((2]) on by a constant. Specifically, we will prove the following theorem: 

Theorem 5. For any e > 0, for sufficiently large k we have 

dfc < 2fc In fc - In A: - 1 + e . (6) 
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We define our random variable as follows. Every coloring (proper or not) of n vertices with k colors 
is an element of [k]" where [k] — {1,2, ... ,k}. Thus the set of colorings is an n-cube of side k, with 
a dimension for each vertex. The set of proper colorings is some subset of this cube, S C [fc]". The 
classic first moment argument we reviewed above computes the expected number of proper (permuted) 
colorings, X = 15*1. Here we define a new random variable, where each proper coloring is given a weight 
that depends on the "degree of freedom" at each vertex, i.e., the number of colors that vertex could take 
if the colors of all other vertices stayed fixed. 

For any proper coloring a, for each vertex v, let c{a, v) denote the number of colors available for v if 
its neighbors are colored according to a. That is, 

c{a, v) = k- \{'Ku,v{(y{u)) I (u, v) G E}\ . 

Note that if there are multiple edges between u and v, they can each forbid v from taking a color. If v 
has a self-loop, we think of it as denying a color to itself, in each direction: 

c{rj,v) = fc — \{%u,v{(t{u)) I {u,v) £ E,u^v}U {7rj,,j,(cr(u)), 7r~J,(cr(ii))| . (7) 

If (T is a proper coloring, we have cia, v) >1 for all v, since w's current color (j{v) is available. Let 

^ I n„ (f if o" is proper 
1 otherwise , 

and let 

z= 

Why this random variable? The expected number of colorings E[X] can be exponentially large, even 
above the threshold where Pr[X > 0] is exponentially small. But close to the threshold, solutions come in 
clusters, where some vertices are free to fiip back and forth between several available colors. In a cartoon 
where each cluster is literally a subcube of [fc]", a cluster containing a coloring a contributes Y\.v ^('^^^) 
to X, but only 1 to Z. Thus, roughly speaking, Z counts the number of clusters rather than the number 
of colorings. Since the sizes of the clusters vary, Z has smaller fluctuations than X does, and hence gives 
a tighter upper bound on dk . We note that "cluster counting" random variables of other sorts have been 
studied elsewhere, such as satisfying assignments with a typical fraction of free variables :9] and certain 
kinds of partial assignments [15j . 

In this same cartoon where clusters are subcubes, Z also counts the number of locally maximal 
colorings, i.e., those colorings where no vertex v can be flipped to a "higher" color q > (j{v), since such 
colorings correspond to the highest corner of the cluster. Bounds on da were derived by counting locally 
maximal 3-colorings in [2] 1121 [TT] . culminating in ds < 4.937. The bounds we derive below are slightly 
weaker for fc = 3, yielding da < 5.011, since we treat the degrees of the vertices as independent rather 
than conditioning on the degree distribution. Nevertheless, computing W,[Z] gives a considerably simpler 
argument for general k. 

Clearly Z > if and only if S 7^ 0. However, in Section |4] we prove the following: 

Lemma 6. If 5 7^ then Z >1. 
Applying Markov's inequality, we see that 

Pr [G has a permuted fc-coloring] ^Vr[Z >1]< E[Z] , 

and any d such that E[Z] < 1 is an upper bound on the threshold d^. Thus we will prove Theorem [S] by 
computing E[Z]. 

Given the symmetry provided by the random edge permutations, the expected weight E[w(cr)] of any 
given coloring is independent of a. By linearity of expectation and the fact that any given a is proper 
with probability (1 — l/fc)™, we then have 

E[Z] = fc" E[w(cr)] = fc"(l - l/fc)" E[w(a) I a proper] . 

Thus we are interested in the conditional expectation 

(J proper . 



'E[w{a) I a proper] = E 



n 



c(cr, v) 
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Lemma 7. Let cr be a permuted fc-coloring. For any vertex v, the conditional distribution of the number 
of available colors c{a, v) is a function only of w's degree. In particular it does not depend on cr. 



Proof. For each neighbor u of v, there is a uniformly random permutation vr = ttu 
from having the color 7r(CT(u)). Since a is proper, we know that 7r(o-(ii)) 7^ Subject to this condition, 

TT is uniformly random among the permutations such that n{a{u)) 7^ ^(17), and thus n{o{u)) is uniformly 
random among the colors other than (t{v). In particular, it does not depend on a{u). 

We can think of the forbidden colors h{(t{u)) as balls, and the colors other than (j{v) as bins. We 
toss degu balls independently and uniformly into these fc — 1 bins, one for each edge (u, v). Then c(v) is 
the number of empty bins, plus one for (t(w). □ 

Lemma 8. Let ct be a permuted fc-coloring, {u,v) an edge, and n = ttu.v the associated random 
permutation. Then Tr{a{u)) and -K^^{a{v)) are independent, and are uniform over [k]~a{v) and [k] — a{u) 
respectively. 

Proof. Since a is proper, the conditional distribution of tt is uniform among all permutations such that 
7r((T(w)) ^ o'iv) and 7r~'^(cr(v)) 7^ a{u). For any pair of colors q,q' with q 7^ cr{v) and q' 7^ o"('u), there 
are exactly (fc — 2)! permutations tt such that -K{a{u)) = g and -K~^{a{v)) — q . Thus all such pairs (g, g') 
are equally likely, and the pair (7r(CT(u)), 7r~"'^((j(?;))) is uniform in ([fc] — c'"(u)) x ([fc] — o"(m)). □ 

Note that Lemmas [7] and |8] apply even to self-loops. That is, if tt = vr^,^ is uniformly random, then 
7r(cr(u)) and 7r^^((7(i')) are independent and uniform in [fc] — cr(u). Thus a self-loop corresponds to two 
balls, each of which can forbid a color. Since a self-loop increases «'s degree by 2, it has the same effect 
as two edges incident to v would have. Indeed, this is why we defined c((t, «) as in ([T]). 

Now let {degflu G V^} denote the degree sequence of G. By Lemmas [7] and [S] the numbers of 
available colors at the vertices c((t, v) are conditionally independent if their degrees are fixed. Thus 



E[i(;((t) I a proper, {degi;}] = 

V 



c(cr, v) 



c{a,v) 



a proper, {deg v} 
a proper, deg« 



Q{degv,k,c) 



where Q{degv, k, c) denotes the probability that v has c available colors if it has degu; that is, the 
probability that if we toss deg v balls into fc — 1 bins, then c — 1 bins will be empty. Thus 

EKa)|aproper]= E TjyQi^^Sv,k,c) 

V c—1 

where the expectation is taken over the distribution of degree sequences in G{n,m). 

The degree of any particular vertex in G{n,m = dn/2) is asymptotically Poisson with mean d. The 
degrees of different vertices are almost independent, as the next lemma shows. 

Lemma 9. The joint probability distribution of the degree sequence of G{n,m = dn/2) is the same as 
that of n independent Poisson random variables of mean d, conditioned on their sum being 2m. 

Proof. We can generate G{n,m — dn/2) as follows. There are n bins, one for each vertex. We throw 2m 
balls uniformly and independently into the bins, and pair up consecutive balls to define the edges of the 
graph. The degree of each vertex is the number of balls in the corresponding bin. The joint distribution 
of these occupancies is the product of n independent Poisson distributions with mean d, conditioned on 
the total number of balls being 2m; see e.g. |18l Theorem 5.6]. □ 
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The sum of n independent Poisson variables of mean d equals its mean nd — 2m with probability 
0{l/y/rn) = 0(1/ •^n). Conditioning on an event that holds with probability P increases the expectation 
by at most 1/P, so 

EKa) la proper] = E TT y W^g^^M 

V C— 1 

/ k \ ^ 

\^dcg„/-^ c y 

where in the second line deg v is Poisson with mean d. Since our goal is to show that E[Z] is exponentially 
small, the ^/n factor will be negligible. 

Now consider a balls and bins process with k — 1 bins. If the total number of balls is Poisson with 
mean d, then the number of balls in each bin is Poisson with mean d/{k — 1), and these are independent. 
The probability that any given bin is empty, i.e., that any given color q 7^ (^{v) is available, is 

^ ^ g-d/{ft-l) ^ 

The number of empty bins is binomially distributed as Bin(A: — ^,r), so 

Q{degv,k,c) = | r"'^ (1 - r)' 

and 

Q{degv,k,c) 1 / fc — 1 



deg V ^ C ^ C I C — 1 ' 

c— 1 c— 1 



kr 



Putting everything together, we have 



E[Z] = fc" ( 1 - 7 ) E[w(cr) 1 a proper] 



^o(VTi)."(i-i)" (^(i-(i-)^))" 



Taking the logarithm and dividing by n, in which case we can ignore the polynomial term ^/n, yields the 
following function of d: 



If /(d) < then E[Z] is exponentially small, so any such d is an upper bound on d^. We prove the 
following lemma in Section [5] 

Lemma 10. For any constant e > 0, if d = 2fc In fc — In fc — 1 + e and k is sufficiently large, then /(d) < 0. 
This completes the proof of Theorem [S] 



7 



4 An isoperimetric inequality 



In this section we prove Lemma[Bl It has nothing to do with colorings; it is simply a kind of isoperimetric 
inequality that applies to any subset of [A:]". If fc = 2, it is the classic isoperimetric inequality on the 
Boolean n-cube. That is, given S C {0, 1}", for each cr G S let d{a) be the set of neighbors a' £ S that 
differ from ct on a single bit. Then 

^2-l«(^'l > 1 
o-es 

if and only if 5 7^ 0. 

First, some notation. Let V — {1, . . . , n}, and think of each element of [k]" as a function a : V ^ [k]. 
Let S C [fc]". For each a £ S and 1 < « < n, let ds{o',v) denote the set of elements of S that are 
"neighbors of a along the v axis," i.e., that agree with a everywhere other than at v. That is, 

i9s(cr, v) = {a' £ S \ ^ V : a'{u) — o-(u)} . 

Let cs((T, v) denote the number of such neighbors, 

cs{a,v) = \dsicr,v)\ , 

and define the weight function ws as follows: 



w;s(cr) = 



njl/cs(<T,«)) iia£S 
iiafl S. 



Then define the weight of the entire set as 

Z{S)= Yl ^^sia). 

<TS[fc]" 

Lemma 11. If S' / then Z{S) > 1. 

Proof. If S = [k]", then ws{^) = l/k" and Z{S) = 1. Thus our goal will to enlarge S until S = [fc]", 
showing that Z{S) can only decrease at each step. For a given a and v, let Cyl„((T) denote the set of 
T £ [fc]" that we can obtain by letting (j{v) vary arbitrarily: 

Cyl„(cr) = {t e [fc]" 1 t{u) = g{u) for all u ^ v} . 

In particular, Cyl„(a) = Cyl„(cr') if and only if o' £ 9s(o", f)- Similarly, let Cyl„(S') be the "thickening" 
of S along the v axis, 

Cyl„(5)= U Cyl„H. 

CTgS 

We claim that this thickening can only decrease Z. That is, for any S 7^ and any v, 

Z{S) > Z(Cyl,„ S) . 

To see this, let T — Cyl„(S'). Each a £ S contributes c„((t, v) times to the union Uaes Cyl„(o"), so 

^(^) = E -T-^ »T(CylJa)) . (10) 

Since each a £ T has ct(o", i;) — fc, each r £ Cyl„((T) has ct{t,u) > cs{(^,u) for all u ^ v, and 
|Cyl„(cr)[ — fc, we have 

WT(Cyl„(a)) < fc — ws{cj) = cs{o, v) ws{o) . 

Combining this with H10|) gives 

^(^) = E ^T(Cyl„(a)) < ws{a) = . 

To complete the proof, let To = 5", and for each 1 < u < n let T„ = Cyl„(r„_i). Then T„ = [fc]", and 

Z{S) = Z{To) > Z(Ti) > • ■ • > z(r„) = 1 . □ 
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Proof of Lemma\^ Let S be the set of permuted fc-colorings. The number of available colors c[a, v) we 
defined in Section [3] is almost identical to cs{o',v) as defined in Lemma [TT] The only difference is that 
in Section 131 if v has a self-loop then it forbids two of its own colors, namely 7r„.i,(cr(u)) and ■k~^{v). 
Removing these self-loops can only increase c((j, v) and thus decrease so if 5 7^ then Z > 1 hy 
Lemma [TT] □ 



5 A little calculus 

Proof of Lemma^ Recall that 



<^(C) = HO + (1 - C) ln(fc - 1) - In fc + ^ In . ^^^^ 



2 (l-l/fc)2 
We upper bound with a simpler function. We have 

p(C) ^_ (fcC-i)^ ^ 1 



(1 - (fc- 1)3 - fe - 1 ' 

and if —1 < a; < 1/(A; — 1) the third-order Taylor series gives 



Therefore, we have (j>iO ^ V'(C) where 

m = HO + (1 - ln(fc - 1) - In fe + ^ / " 



2 \{k~l) 
This upper bound is tight at = 1/fc, where 

V'(i/fe) = <^(i/fe) = 0. 

We remark that using the first-order Taylor series ln(l + x) < x, or equivalently £{x) = x, yields a weaker 
lower bound on dk, about Infc below the first moment upper bound. 

Except for the entropy function, the dependence of on ( is polynomial, making its derivatives 
significantly simpler than those of (j>- First we note that 

r{i/k)^j^^ {d-ik-if) . 

Thus if d < (fe — 1)^, we have 

(t)"{l/k) < i>"il/k) < 0. 
Next, if fe > 2 then the fourth derivative of tp{0 is negative throughout the unit interval, 

As a consequence, has at most two local maxima in the unit interval, one of which is at = 1/fc. 

Our goal is to locate the other local maximum, which we denote (2, and to show that i/'(C2) < 0. 
Assume that d is 2 + e below the first moment upper bound for some constant e, 

d = 2felnfe-lnfe-2-e, 

and set 



- ^ - fe 



for a constant a. Then using Taylor series gives 



lna-21nfc-hdQ + i^j + 0(l/fe) 

\na + (1 - 2a)^^ + 0(l/k) . 
fe 
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For any constant a 7^ 0, for sufficiently large k tliis is positive if a > 1 and negative if a < 1. Therefore, 
("2 = 1 — a/k where a tends to 1 as A; — >■ 00, roughly as a ~ fc^''*. 

Finally, again setting d = 2fclnfc — Infc — 2 — e, some more Taylor series give 

V'(l ~ a/k) = h{a/k) + y ln(fc - 1) - In fc + - ^ " ^ " 



fc ' ' 2 \ (k-l) 

= - (a-alna + 2a\nk) + 0(l/k^) - In fc + - ( i + ^—4^ + 0(l/k^) 
k 2 \k 2k^ 

e ^ / In fc 
^-2I + ^(t^ 

where we used a — alna < 1 for all a > 0. Thus for any constants a > and £ > 0, when k is sufficiently 
large — a/k) is negative, showing that t/'(C) < in the vicinity of C,2- This completes the proof. □ 

Proof of Lemma Udl Recall that 

5 >»(' 4)+!^ + ■»('- 

Setting 

d = 2fclnfc-lnfe-l + e, 

we have 

Taking the binomial series to second order gives 

^ 1 _ fce-''/('=-i) + ^^^^ e-^'i/C'-^) + o{k-') . 
Plugging this into the last term of /(d) and using the Taylor series for ln(l — x) gives 

In (1 - (1 - e-'*/"=-^))') = In (^ke-"^^'-'^ - ^-^^/i^-^) + 0{k-')^ 

= In fc - + In f 1 - i^-il e-'*/('=-i) + 0(fc-= 

K — 1 \ 2 

= In fc - ^ - e-<^/(^-i) + o(fe-^) 



and plugging this back in to /(d) gives 



/(d) = ^ln( 1-^1 +lnfc-^+0(fc-^logfc). (11) 



The second order Taylor series for ln(l — x) gives 



= - In it + + 0(fc~^ log k) . 

2k 



Finally, putting this in ([TT} gives 
For any constant e > 0, this is negative for sufficiently large k, completing the proof. □ 



f{d) = -^ + 0{k-'logk). 
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